Corpus: deu-ch_newscrawl_2012_1M

Other corpora

2.2.5 Most frequent word beginnings

The most frequent word beginnings as character N-grams for N=1...5 with Zipf's diagram


Zipf's diagram for word beginnings


Gnuplot diagram

Top Characters
word rank frequency n-gram
1 56041 S-
2 34255 B-
3 32903 A-
4 30005 M-
5 26469 K-
Top Character Bigrams
word rank frequency n-gram
1 11732 St-
2 11664 Sc-
3 10502 Ge-
4 9111 Be-
5 8933 Ma-
Top Character Trigrams
word rank frequency n-gram
1 11227 Sch-
2 6913 Ver-
3 5206 ver-
4 3860 Sta-
5 3082 Mar-
Top Character 4-Grams
word rank frequency n-gram
1 2160 Schw-
2 1698 Inte-
3 1618 Schu-
4 1558 Schl-
5 1447 Unte-
Top Character 5-Grams
word rank frequency n-gram
1 1436 Unter-
2 1429 Inter-
3 1150 Schwe-
4 1132 Gesch-
5 1076 Schul-
7198 msec needed at 2018-11-26 16:39